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(57) Abstract 

The method of the invention provides for the formation of a leccmbinant polypeptide which has been coodified at the C-tenninal end 
through the use of a nanspeptidation process. The mediod is suitable for noodifying recombinant polypeptides of any source including tbose 
which may be ccmmoercially available, those derived from lecombinant single copy or multicopy polypeptide constructs, or those derived 
from single or multicopy recombinant fusion protein constructs. Tbe transpeptidation reaction involves contacting an endopeptidase enzyme 
with a recombinant polypeptide to substitute an addition unit, of one or more amino acids, for a leaving unit, linked to a cotc polypeptide 
through a cleavage site recognized by the endopeptidase enzyme. Recombinant polypeptides derived from multicopy polypeptide consttucts 
may be cleaved from the mukioopy polypeptide at die N-tcnninal and C-terminal erxls and simultaneously undergo substitution of the 
leaving unit by the desired addition unit. The invention utilizes known and newly discovered cleavage recognition sites to effectuate the 
desired modificatioii products. 
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ENZYMATIC METHOD FOR 
MODIFICATION OF RECOMBINANT POLYPEPTIDES 

5 

Baekaround of the i nvention 

Many naturally occurring proteins and peptides 
have been produced by recombinant DNA techniques. 

10 Recombinant DNA techniques have made possible the 

selection, amplification and manipulation of expression 
of the proteins and peptides. For example, changes in 
the sequence of the recombinantly produced proteins or 
peptides can be accomplished by altering the DNA 

15 sequence by techniques like site-directed or deletion 

mutagenesis . 

However, some modifications to a recombinantly 
produced protein or peptide cannot be accomplished by 
altering the DNA sequence. For example, the C-terminal 
a-carboxyl group in many naturally occurring protein and 
peptides often exists as an amide, but this amide 
typically is not produced through recombinant expression 
and is biologically converted after expression in vivo 
from a precursor protein to the amide. 

A method of forming a c-terminal amide on a 
recombinantly produced polypeptide by the action of an 
enzyme is known: The enzyme is peptidyl glycine 
a-amidating monooxygenase and is present in eukaryotic 
systems. The enzyme has been used to form an amide on 
30 the C-terminal amino acid of recombinantly produced 

peptides, like human growth hormone releasing hormone in 
vitro , as described by J. Engels, ProtPin Engineering, 
1:195-199 (1987). While effective, the enzymatic method 
is. time consuming, expensive, given unpredictable 
35 yields, and requires significant post-reaction 

purification. The enzymatic method is also limited to 
modifying the recombinantly produced peptide by 
C-terminal amidetion. 

Accordingly, there is a need for a chemical 
40 method that provides for modification of C-terminal 
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o-carboxyl groups of a recombinantly produced 
polypeptide. There is also a need for a method of 
modification that allows addition of a variety of 
moieties to the C-terminal a-carbon reactive groups of a 
5 recombinantly produced polypeptide and that is 

convenient, cheap and capable of producing terminally 
modified recombinant polypeptides in high yield. 
Therefore, it is an object of the invention to develop a 
biochemical method for selective modification of the 
10 C-terminal amino acid of a recombinantly produced 

polypeptide. A further object is to provide a simple 
and economic method for modification of the C-terminal 
amino acid through a transpeptidation reaction. 



15 Summary of the Invention 

These and other objects are accomplished by the 
present invention. The present invention is an 
economical biochemical method for modification of the 
C-terminal amino acid of recombinant polypeptides to 
20 provide polypeptides which cannot normally be obtained 
through recombinant technology. 

The process of the invention utilizes 
transpeptidation which involves contacting an 
endopeptidase enzyme, specific for an enzyme cleavage 
25 site, with a recombinant polypeptide, composed of at 
least one core linked by a cleavage site to a leaving 
unit, in the presence of an addition unit. The 
endopeptidase enzyme cleaves the leaving unit from the 
core at the cleavage site and simultaneously causes the 
30 core and the addition unit to form the desired modified 
recombinant polypeptide. Alternatively, the cleavage of 
the leaving unit and the formation of the linkage 
between the core and addition unit may be completed in 
two separate steps. Subsequent to transpeptidation, 
35 further enzymatic modification of the terminal amino 

acid carboxy group of, the addition unit, through known 
enzymatic methodology, is possible. 
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The endopeptidase enzymes used according to the 
method of the invention include those of the serine or 
cysteine peptidase class. The endopeptidase enzymes 
trypsin and thrombin, of the serine peptidase class, are 
5 especially desirable endopeptidase enzymes to serve as 
cleavage enzymes for the method of the invention. 

The recombinant polypeptide starting material 
includes a core which may be a truncated version of its 
natural form. The core may be truncated through 
10 deletion of amino acids at either, or both, of its 

C-terminal and N-terminal ends depending on the product 
desired. The recombinant polypeptide also includes a 
leaving unit linked to the core by an enzyme cleavage 
site recognized by the endopeptidase enzyme. The 
15 leaving unit may be one or more amino acid residues. 
The amino acid cleavage site for the 
endopeptidase enzyme may be recognized by the 
endopeptidase enzyme in solo or as a part of a multiple 
amino acid recognition sequence. In addition, according 
20 to the method of the invention, cleavage sites which are 
normally cleaved by an endopeptidase enzyme may be 
rendered less reactive or unrecognizable when adjacent 
to certain other amino acids. Use of this knowledge to 
cause some cleavage sites to be less reactive is used 
25 advantageously to render new and substantial utility to 
endopeptidase enzymes which may otherwise be precluded 
from use in certain transpeptidation reactions. The 
ability to cause combination of the addition unit with 
the core is a desirable characteristic of the 
30 endopeptidase enzyme. The addition unit may be one or 
more amino acid residues which may be modified at the 
C-terminal a-carboxy at the time of transpeptidation, or 
may be further treated by known enzymatic methodologies 
subsequent to transpeptidation. 
35 The entire transpeptidation process may be done 

in a single step under very mild conditions. The 
starting polypeptide of the invention may be a single- 
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Cleaving and reacting the starting -ater.al can vary 
, depending on the starting material used. 

The recoinbinant multicopy polypeptide may be 
multiple copies Of the single copy P°l^-P^"^^^f 
together with or without intraconnectrng P^^"^". « 
an intraconnecting peptide is present, it has at least 
J one site that is selectively cleavable by the 

endopeptidase cleavage enzyme. The i""---"^^^ 
peptide may also serve as the leaving group from the 
C-terminal end of a single copy core P^^^eptide 

The single copy polypeptides withrn a multicopy 
5 polypeptide may be linKed directly to each other through 

Il-Lnrls p::refred\hat a multicopy recombinant 
polypeptide 13 cleaved into single copy core unrts and 

,0 LStaneously transpeptidated .hen in the P--- ° " 
suitable addition unit. The downstream core acts as 
leaving group for the core Immediately preceding it. 
leaving gr P ^ multicopy recombinant 

The terminal single copy uu-l i-hat 
polypeptide is linked to a suitable leaving unit so that 
all single-copy polypeptides within the multicopy 

" rl^Inlnt polypeptide are transpeptidated according to 

-^"l tZ "Lstruct has three tandomly- 
llriced segments including a binding protein connected 

30 via an interconnecting peptide to a single copy or 
multicopy polypeptide. The interconnecting ^^^^ 
at least one site that is selectively cleavable by a 
chemical or enzymatic method. The binding protein with 
the interconnecting peptide acts as a biological 

35 protecting group and aids in the purification of the 
recombinant multicopy polyp ptide. 
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n.scrir^^°» °^ the ^invention 
Recombinant DNA techniques have made possible 
the selection, amplification, and manipulation of 
expression of many naturally occurring protexns and 
5 peptides. It is often desirable to selectively modify a 
reLbinant polypeptide at the N-terminal a-amxne and/or 
C-terminal a-carboxyl groups. For example, the 
C-terminal reactive carboxyl groups in some naturally 
occurring proteins and peptides can be selectively 
10 converted to an amide to provide for enhancement of 
biological activity. Alternatively, a D-amxno acxd or 
peptide could be added to replace a terminal amxno acxd. 

These modifications can result in the formation 
of analogs of the recombinantly produced polypeptide 
15 that are longer acting and more potent than the 

naturally occurring polypeptide. Generally, these types 
of modifications to the recombinantly produced 
polypeptide are not accomplished by alteration of the 
DNA sequence for the recombinantly produced polypeptide 
20 because there is no genetic code providing for amino 
acid amides, or incorporation of D-amino acid or an 
amino acid derivative. 

The present invention provides a process for 
the selective modification of a recombinantly produced 
25 polypeptide by a single-step transpeptidation process at 
cleavage sites specific for various cleavage enzymes. 
Alternatively, a two-step transpeptidation process may 
be used whereby the polypeptide is first enzymatically 
cleaved at the cleavage site to form the hydrolysis 
30 product, which is then condensed with a suitable 
addition unit to form the modified recombinant 

polypeptide product. 

The process allows for efficient modification 
of recombinant polypeptides to produce products for 
. 35 which there is no genetic code, for example, C-terminal 
a-carboxyl amidation. 
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process provides for edification of a 
.eco^^inan. polypeptide throu,. ".nspept d« • 

,.,oses Of t.is .Cid or 

defined as that process whereby 

. chain Of a^ino ' -"" ite a 'th C-ten.inai end 

endopeptid.se enzyme cleavage 

„ . „co^inant poi^^^^^^ 

rh:^:::iroran endopeptid c„ e.y^^ - 

.ethod of the invention utU.zes an end P P 

«r-aferablv of the serine or cysteine c 
enzyme, preferably i:ranspeptidation 
the cleavage enzyme to catalyze 

. The recombinant polypeptide includes a core 

' ■ .„it The core is any useful 

portion and a leaving un.t^ The o ^ 

polypeptide sequence such J 
modified native sequence, a non nativ q 

„al ^c--. — rifre or .ore 
n similar versions. -i"** ^ ^ ^^4- 

0 similar Preferably, the leaving unit is 

amino acid units. Preferaoiy, linkage which 

According to the method of the 
cleavage enzyme. Accora g leaving unit 

or multicopy fusion P-*-'" ^„„„„,d with 

,„ The recoidilnant polypeptide is contact 

at least L endopeptidase cleavage enzyme specific for 

at least on enzymatic cleavage of 

at least one cleavage site. Tne ^ » 

,he recombinant polypeptide at the 

portion and the leaving unit is conducted in the 
35 Trrsence of an addition unit. An addition unit .s a 
35 presence u residue which may be 

:::rrrtfc!trr:rna:"-carbon. .tematively. 
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modification of the C-terminal a-carbon end of the 
addition unit may be done subsequent to the 
transpeptidation process . 

The method of the invention also provides for 
cleavage of recombinant multicopy polypeptides into 
single copy polypeptides by the endopeptidase cleavage 
enzyme. Under the method of the invention, in the 
presence of a suitable leaving unit, cleavage of the 
multicopy polypeptide will occur simultaneously with 
single-step transpeptidation. Alternatively, the 
polypeptide may be cleaved, at the cleavage site, to 
form the hydrolyzed cleaved polypeptide which 
subsequently undergoes condensation with the addition 
unit to form the modified recombinant polypeptide 
product . 

I • Transpeptidation 

The method of the invention provides a modified 
recombinant polypeptide product produced by 
20 transpeptidation of a recombinant polypeptide. The 
sequence and number of steps in the method of the 
invention can be varied depending upon the desired 
modification of the recombinant polypeptide, the amino 
acid sequence of the desired product peptide, and the 
25 starting material selected. The transpeptidation method 
of the invention calls for the recombinant polypeptide 
to be contacted with an endopeptidase cleavage enzyme, 
which has specific cleavage activity at the linkage 
between the core and the leaving unit. 
30 The endopeptidase cleavage enzyme cleaves the 

leaving unit from the carboxy terminal of the core of 
the recombinant polypeptide. Although it is not 
intended to be a limitation of the invention, it is 
believed that during this cleavage, the enzyme forms an 
35 acyl- or thioacyl -enzyme intermediate with the core. In 
the presence of an appropriate addition unit, under 
proper conditions, the enzyme causes the addition unit 
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to add to the cleaved core. Although it is not intended 
to be a limitation of the invention, it is believed that 
to accomplish this combination, the addition unit 
displaces the cleavage enzyme from the acyl-enzyme 
5 intermediate and links to the core polypeptide where the 
leaving unit was linked. The production of the modified 
recombinant polypeptide is monitored by HPLC or other 
analytic procedure and the reaction is stopped by the 
addition of an acidic solution when the reaction has 

10 reached completion. The amino acid or terminal amino 
acid residue of the addition unit may already be 
modified at its carboxy terminal end at the time of 
undergoing the transpeptidation reaction, such as by 
modification of the C-terminus carboxylic acid to a 

15 carboxamide, or, alternatively, be modified after 
formation of the modified recombinant polypeptide. 

According to the method of the invention, 
preferably, the cleavage site recognized by the cleavage 
enzyme is a site not duplicated in the core or is not at 

20 an enzyme accessible site within the core. The method 
of the invention is also directed to an endopeptidase 
enzyme cleavage of a multicopy recombinant polypeptide 
into single copy recombinant polypeptides and 
simultaneously transpeptidating the cores to form the 

25 modified recombinant polypeptide in a single biochemical 
reaction. 

The invention is further directed to modified 
enzyme cleavage sites which, when adjacent to certain 
amino acid residues, render the site unrecognizable or 

30 less reactive to cleavage. The discovery of the use of 
these unrecognizable or less reactive sites to prevent 
cleavage, renders new and substantial utility to various 
cleavage enzymes which would otherwise be precluded from 
use in certain transpeptidation reactions due to the 

35 detrimental effect of cleavage of recombinant 
polypeptides at sites within the desired core. 
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The leaving units to be cleaved from the core 
are specifically chosen to provide a suitable leaving 
unit for the specific endopeptidase cleavage enzyme. 
The addition units are chosen to provide the amino acid 
5 or peptide chain to complete formation of the desired 
modified recombinant polypeptide. The amino acid or 
terminal amino acid of the addition unit may be modified 
at the C-terminal a-carboxy or it may be modified after 
transpeptidation. Alternatively, the addition unit may 

10 be a peptidomimetic and serve as a linker between the 
core and attached functional unit, as disclosed in co- 
pending patent application Serial No. 

The cleavage enzymes, according to the method 
of the invention, include the class of endopeptidases . 

15 The endopeptidases suitable for use in the present 

invention include the serine and cysteine peptidases. 
Although it is not intended to be a limitation of the 
invention, the mechanism of action of serine and 
cysteine endopeptidases is believed to involve the 

20 formation of an acyl- or thioacyl-enzyme intermediate 
with the core after cleaving the leaving unit. Under 
appropriate reaction conditions, it is believed that the 
addition unit acts as a nucleophile and displaces the 
endopeptidase cleavage enzyme from the acyl- or 

25 thoiacyl-enzyme intermediate. 

Serine peptidases are in a group of animal, 
plant and bacteria endopeptidases which have a 
catalytically active serine residue in their active 
center- Representative examples of endopeptidases of 

30 the serine peptidase classification include trypsin, 
thrombin, chymotrypsin, enterokinase, subtiliisin, and 
factor Xa. Representative examples of the scysteine 
peptidase classification include ficin and papian. 

The endopeptidase trypsin is found in the 

35 pancreas of all vertebrates. It is released via the 
pancreatic duct into the duodenum as trypsinogen. 
Conversion of trypsinogen into trypsin is initiated in 
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the small intestine by enterokinase . Natural or 
synthetic forms of trypsin are suitable for the method 
of the invention. 

Trypsin is known for its pronounced cleavage 
5 site specificity, catalyzing hydrolysis of only the 

carboxyl end of the -Lys-X and -Arg-X bonds. Trypsin's 
affinity for cleavage at the -Lys-X bond is 
significantly diminished when immediately adjacent to an 
amino acid containing a carboxylic acid side chain, 

10 specifically including the amino acids glutamic acid and 
aspartic acid (i.e., X = glutamic or aspartic acid). A 
discovery of the present invention utilizes the 
knowledge of decreased cleavage activity at the -Lys-X 
cleavage sites when X = an amino acid containing 

15 adjacent to an amino acid containing a carboxylic acid 
side chain (i.e., X = glutamic or aspartic acid). This 
discovery has rendered the endopeptidase trypsin of 
great utility in the formation of modified recombinant 
polypeptides, according to the method of the invention. 

20 Natural or synthetic forms of thrombin are suitable for 
the method of the invention. 

The glycoprotein endopeptidase thrombin, also 
of the serine peptidase classification, is responsible 
for the conversion of fibrinogen to fibrin. It is 

25 naturally produced during blood coagulation by the 

action of factor X^ upon prothrombin. This endopeptidase 
has considerable sequence homology with trypsin and 
contains the catalytically important residues His, Asp, 
and Ser in the B chain. Thrombin has a cleavage 

30 specificity for the carboxy end of the -Arg- cleavage 
site in specific peptide sequences known as recognition 
sequences . 

Thrombin is known for its cleavage site 
specificity at the carboxyl side of the Arg- residue 
35 within the known recognition sequence for the cleavage 
site -Arg is Gly-Pro-Arg-. A discovery of the present 
invention is that thrombin also cleaves at the carboxyl 
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•thin the recognition sequence of Gly-Ala- 
side of Arg- wxthxn the re g thrombin for 

This discovery enhances the use ot 
Arg. j.iiJ-» J 4-v%i'c invention as weax 

Ala-Arg recognition sequence wxll be 



Trypsin 



10 



15 



s«.Un, rec— ^^J^^''^ ^^.^ico^y fusion 
multicopy constructs, or sxngxe o 

protein constructs. 

- '^^^^^^^^^ 

The Jollovin, description is b»sad «pon a 

^:nr;ulUL tor „anspoptida.ion o. 
of the invention is suii^awj. 

polypeptides, regardless °* f ; ,„.,«ion 
The transpeptidation process oi h„ffer 

succinate, citrate, phosphatate ^^^^'JZ Z^n.ion 
hepes, and the lite. In one 

rr r rtrrpfpfi^^^^^^^^^^^^^ " ^r-- 

:r.odl.ed reco^inant pol^^^^^^^ 

r:rd:cr;i:ra-;s;ru . se.erai 

35 issues and has bee. shown to be an ^-retin and is 
co^nly referred to as GLIP. The secpence of 
(l-36)-MH! (SEQ ID MOil) 1S> 
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-His 
7 

Asp 



12 

-Ala-Glu-Gly-Thr-Phe-Thr-Ser- 
-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 



10 



Ala-Ala-Lys-Glu-Phe-Ue-Ala-Trp-Leu- 

26 27 

Val -Ly s -Gly-Ar g-NH2 
34 36 



^5 According to the invention, the trypsin 

catalyzed transpeptidation ^^-^^^U^Z^.n, unit, 
substitution of the addition -^^/f^^^^^^^^.y^Lxysis 
is in competition with the trypsin "^alyzed y 

20 at the carboxy terminus of the a..no acxd at the 
o^i-,. There are two ways to affect the 

process over hydrolysis. 

in the first variation, the recombinant 
, GLPl (7-34) core linked to the leaving unit 

polypeptide ^^^^^^ ^..^ (SEQ ID N0:2): 

30 -Ala-Phe-Ala at a -Lys- cleavag 

_His-Ala-Glu-Gly-Thr-Phe-Thr-Ser- 



35 



40 



Asp-Val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 

Ala-Ala-Lys-Glu-Phe-Ile-Ala-Trp-Leu- 

26 27 

Val-Lys-Ala-Phe-Ala 
34 



is dissolved in buffer. To the transpeptidation 
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constant ,K., of the --°P^P"<'°-;^'=>'\^r™tn 
tte reccnbinant polypeptide and the concentration o£ 

to*ina„t polypeptide In the »i«ture. 7=^-;^^ '^f 

reL:re:;« . . . the ^^-^ 
::-;r;rardrr^^^^^^^^^^ 

iftLrunlts .o. synthesis o. the „odlUed reco^.nant 

polypeptide product GLIP (SEQ ID HO:!) and GLPl 

(7-36)-Gly (SEQ ID N0:3), 

-His-Ma-Glu-Gly-Thr-Phe-Thr-Ssr- 

7 

isp-val-Ser-Ser-Tyr-Leu-Glu-Gly-Gln- 

Ala-lla-Lys-Glu-Phe-Ile-ila-Trp-Leu- 
26 27 

Val-Lys-Gly-Arg-Gly 
34 36 

respectively. The cleavage enzyme trypsin i= 
an effective catalytic a^unt but not so 9«at as to 
cause substantial secondary ; / Clge 

at other sites, hydrolysis, and the like. The y 
ensy.e trypsin is added to the 
0 trypsin=polypeptide .olar ratio of about 1 = 10 to 
1^0,000, preferably 1 = 100 to 1 = 100,000, and n,ore 
preferably 1=200 to 1=50,000. 

The production of the modified reconO^rnant 
polypeptide product GLIP or GLPl (7-36,-Gly is monitored 
,5 by SlC, laser desorptlon, mass spectrc^try or other 
analytical method, and the reaction stopped by the 
addition of an acid solution. The reaction procedure 
may be stopped by an acid solution at about pH 3 
suitable acid solutions for stopping the 
,0 include hydrochloric, sulfuric, acetic, and the like. 

Alternatively, the trypsin catalyzed , 
competitive reactions of hydrolysis and transpeptidatron 
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.-...d in favor of transpeptidation through the 
may be shxfted m favor solvents for the 

use of organic solvents Su table ^^^^^^ 

transpeptidation reaction, ^^^^^^^^^ 

Include DMSO and 75% v.v, 
the inventxon, xnclud 
5 N,N'-dimethylacetamde and 95% v.v 9 

int^^-^^E^i^^-^^^^^^^' -Z Tilant polypeptide 
p.oduct requires an a..^^^^^^^^ ^^^^^^^ ^^^^ 

an addition „.,,,boxyl group may be 

10 acid was used, ^^^^J^"^^^^" c-terminal a-carboxyl 

^,.ated in a f urt -p^ ^ ^ 

group .ay be -^^^^^J^ ^ (7.36)-Gly by the use of the 
cited supr a, for the GLPl \ i ^ ^ as described in 

• =.1 « carboxyl amidating enzyme, as aescrx 
C-terminal o-carooxyx n 4 . i876-l877 

tS'proclre an. a« incorporate, .ere.n 
,0 «.i«ed recombinant polypeptide product i. 

p„ri«ed .ro. t.e ^^^^^^l!^ . 
Chromatography, hydrophobic """"^ ^^^^^^ 
or particle exclusion chro,«atography . 

. »h= cooarated product may be further 
contaminatxon, -^"^ a£or»entioned method.. 

immediately or may be stored oy j f 

cryopreser,ation at ""J^' ^„,„p,p,i,„e cleavage 

" t:^":. tre'truncated'oLl core .rem the 
,0 en.y»e -■^"";^^J^3^;3^ .^^..i, ,l.„a,e site. (See 

Cleavage site 'P-"-;;:^,;; f-'s , re 'so 
bonds. It is noted that the GLPl (7 3) 

3= — - ~ a's tt cUd b, 

rsir^rrsrUevrthalL .decent carbo^Uc 
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acid, side chain containing amino acid adjacent to -Lys- 
rendered the -Lys- cleavage site less reactive. The 
method of the invention utilizes the knowledge that 
-Lys- followed by an amino acid with a carboxylic acid 
containing side chain is a poor substrate, and the 
discovery that lysyl glutamyl at 27-28 is not hydrolyzed 
during the time required for complete hydrolysis of 
lysl-histidyl at 6-7 and lysl-glysyl at 34-35 of GLPl 
(1-37)- For example, -Glu- renders lysyl poor cleavage 
site in -Lys-Glu- a poor substrate for trypsin. This 
allows the serine peptidase trypsin to be utilized as a 
cleavage enzyme when there exists multiple recognized 
-Lys- cleavage sites, but only the desired cleavage site 
is not adjacent to an amino acid containing a carboxyl 
group containing side chain. 

2. Trypsin Transpeptidation of a 

Single Copy Recombinant Polypeptide 
Derived from a Fusion Protein Construct 

The GLIP and GLPl (7-36)-Gly modified 
recombinant product polypeptides may be produced by the 
method of the invention starting with a recombinant 
polypeptide derived from a recombinant single copy 
25 fusion protein starting product. As discussed infra, a 
fusion protein construct serves as a carrier protein 
system for recombinant polypeptides and provides an 
efficient system for chromatographic purification. In 
this variation, the fusion protein construct is first 
30 purified from the other cell constituents, as described 
below at section II. (See this section also for 
definitions of the fusion protein terms.) Once the 
fusion protein construct is purified from the other cell 
constituents, preferably, the binding protein is 
35 separated from the recombinant single copy polypeptide. 
According to the method of the invention, the separation 
of the binding protein from the recombinant single copy 
polypeptide is accomplished by cleavage of the 
interconnecting polypeptide or amino acid. Depending on 
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, ^rM-ide or amino acid used, 
the interconnecting of a cleavage 

cleavage may be ^^''^''^''^^''^^ J^^^^^ , For example, the 

Chemical cleava^^^^^^^^^^^ ^ Connecting amino acid 
5 formic acid cleaves tn polypeptide is 

methionine, once the ""Sle copy P TP ^^^^^^ 

released .ro. ""t ^rin the art such as 
the binding protein by '^^ procedures includin, 

precipitation and ^^^^^^'^'^''^J^ ,He li^e. Once 
,0 Size exclusion, ^^^'-f^U^^.^ia is transpeptidated 
T^U^^^ - -ntion. as described 

above . 

simultaneous .ith the ^ be 

- : re o^fnLt multicopy polypeptide is 

incorporated into a r ,„al>iiities oi the 

Umited only by the P^^-» "^f,, expression of 
25 specific expression system recombinant 
L reco^inant mult^o^ -^r^ltiP- single copy 

multicopy ;f,;ti;,::onnecting peptides between 

core polypeptides «th intr „ .i„gle 

individual single copy =°"/°Jf ^,„,tly to one 

- rn'^refaLrnarivrtho linkage between the 

another, m either a between 
intraconnecting P°l^f^f°;ts is preferably a 
.erectly linked / ^ ;ndop:ptidase cleavage 

cleavage J ,„i«ion of a multicopy 

35 enzyme. Further, - "^^^ polypeptide 
polypeptide, only the ""^J' ^^^^ „on-terminal 

need be linked to a leaving unit . ^„„,3,„^ 
single copy recombinant polypeptides. 
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polypeptide acts as a leaving group for the immediately 
preceding polypeptide. 

According to the method of the invention, when 
the recombinant multicopy polypeptide is composed of 
5 single copy core polypeptides linked by intraconnectxng 
peptide, the potential peptides which may be used as 
intraconnecting peptides only require that the terminal 
ends are composed of amino acids which will not inhabit 
the cleavage activity of the endopeptidase cleavage 
10 enzyme at the linkage. For example, in one variation, 
the 1-6 amino acid sequence of the unprocessed natural 
form of GLPl may serve as an intraconnecting polypeptide 
between individual single copy GLPl (7-34) unit (SEQ ID 
N0:4): 

15 His-Asp-Glu-Phe-Glu-Arg-His-Ala 
1 ^ 
Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 



20 



Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 



Lys-Glu-Phe-Ile-Ala-Trp-Leu 
25 26 27 

val-Lys 
34 

30 in this embodiment, trypsin will cleave the recombinant 
multicopy polypeptide into recombinant single copy 
polypeptides at the -Arg- residue at amino acids 6-7 and 
at the -Lys- residue at amino acids 34-1 to yield single 
copies of core GLPl (7-34) and intraconnecting units of 

35 (SEQ ID N0:5) : 

His-Asp-Glu-Phe-Glu-Arg-His 
1 6 7 



40 



The reaction will also yield a single terminal 
Ala-Phe-Ala leaving group. When the reaction is 
conducted in the presence of an appropriate nucleophilic 
addition unit, such as Gly-Arg-NH, or Gly-Arg-Gly, 
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transpeptidation occurs yielding the '^"'''f 
reccbinant polypeptide products GLIP and GL^l 
(7-36)-Gly, respectively. 

It is further recognized that the 
interconnecting polypeptides .ay be ^l--^^^^"" °' 
.fter transpeptidation by a chemical or enzy.at.c 
cleavage agent. (See Table 2.) 

According to the method cf the invention, the 

recombinant multicopy polypeptide is P-"^** ^= 

=ii Protein purifxcationj. 

described in Scopes et al., f"*^^" ^ 

r,— - p... ..rt practice , Springer-Verlag, »e» York 
71^,: ,hich is mco^^rated herein 
purified multicopy recombinant polypept.de iS then 
further processed according to the method of the 

n^entL:. »s previously discussed for the recombinant 
Single copy polypeptide, the reaction is conducted, 
preferably, in a buffered solution at pH 5-11. As 
described earlier, the amount of ^ J^^';" 
is in the range of one e^ivalent up to 20 x K of the 
, enzyme to the addition unit. Trypsin is -•^-^^^ » ^ 
tr^in=polypeptide ratio of preferably ^"^'/^^ 
1^,000. Simultaneous cleavage of the recombrnant 
multicopy polypeptide and transpeptidation yields 
multiple copies SEQ ID «0=5, multiple "P^- °' 
5 recombinant GLPl (7-34) core, and one 

, ■ ,^n,m The Gly-Arg-NH, or Gly-Arg-Gly addition 
irra nrieopLJand transpeptidation occurs 
at amino acid residue 34. The production of modified 
rLlinant polypeptide CLIP or GI,P1 ,7-36,-Gly product 
,„ is monitored by HPX^ or other analytical technx^e and 
the reaction stopped by the addition of a suitable add 

as described above. ^-ts^A 
in an alternative variation, the modified 

recombinant polypeptide products may be ^^^-J^J^ 
35 method o£ the invention, through simultaneous cleavage 
and transpeptidation of recombinant multicopy 
polypeptide units composed of multiple single copy 
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20 



25 



30 



polypeptide units connected without intervening 
intraconnecting peptides. According to this variation 
of the invention, for example, a recombinant multicopy 
polypeptide multiple single copy 6LP1 (7-34) cores wxth 
a terminal GLPl (7-34) core linked to a Ala-Phe-Ala 
leaving unit is expressed. The expressed multicopy 
recombinant polypeptide is purified from cell 
constituents, as previously described. The multicopy 
construct is treated with the endopeptidase enzyme 
trypsin, as previously described by the method of the 
invention. Trypsin will cleave the multicopy 
polypeptide into single copy polypeptides at the 34-7 
-Lys- residue (see SEQ ID N0.3). Simultaneous with 
cleavage, the single copy polypeptides will undergo 
transpeptidation, as previously described, yielding the 
GLIP or GLPl (7-36)-Gly products in the presence of 
Gly-Arg-NH^ or Gly-Arg-Gly addition units, respectively. 

in another variation, it is further recognized, 
that the GLPl (l-36)-NH, and GLPl (l-36)-Gly modified 
recombinant product may be prepared, according to the 
method of the invention, using mutant forms of trypsin, 
in this variation, a multicopy recombinant polypeptide, 
as previously described, is synthesized using multiple 
single copy GLPl (1-34) units (SEQ ID N0:6) 

His-Asp-Glu-Phe-Glu-Arg-His-Ala 
1 

Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 
Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 



35 



Lys-Glu-Phe-Ile-Ala-Trp-Leu 
26 27 



40 



Val-Lys 
34 
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contiguously connected without an intraconnecting 
peptide and with a leaving group only at the terminal 
polypeptide. The mutant trypsin endopeptidase enzymes 
used for this variation have a decreased rate of 
5 cleavage at the -Arg- site and a normal rate of cleavage 
at the -Lys- cleavage site. These mutant forms will, 
therefore, cleave the recombinant polypeptides at the 
34 -Lys- residue, but will not cleave at the -Arg- 6 
residue yielding multiple single copy GLPl (1-36) and a 

10 single leaving group. In the presence of a suitable 
addition unit such as Gly-Arg-NHz or Gly-Arg-Gly, under 
the conditions of the invention, the GLPl (1-34) core 
units will be transpeptidated yielding the GLPl 
(l-36)-NH2 or GLPl (l-36)-Gly products. 

15 The foregoing transpeptidation processes 

described for a multicopy recombinant polypeptide may 
alternatively be conducted in organic solvents conducive 
to the transpeptidation process, as described earlier. 

20 4. Trypsin Transpeptidation of a 

Multicopy Recombinant Polypeptide 
Derived from a Fusion Protein Construct 

The modified recombinant polypeptide products 
25 may be produced, according to the method of the 

invention, by transpeptidation of recombinant single 
copy core polypeptide units which have been derived from 
a multicopy polypeptide unit which has been derived from 
a fusion protein construct. The number of recombinant 

30 single copy core polypeptides included within the 

recombinant multicopy polypeptide is limited only by the 
physical capabilities of the chosen expression system. 

The multicopy fusion protein construct is 
formed, purified from the other cell constituents, and 

35 the binding protein is separated from the recombinant 
polypeptide, as described at section II. The purified 
recombinant multicopy polypeptide, separated from the 
binding protein, is then further treated as described 
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above to yield the desired modified recombinant 
polypeptide products . 

B. Transpeptidation Using the Endopeptidase 
5 Thrombin 

Another example of the endopeptidase which may 
act as a cleavage enzyme according to the method of the 
invention is thrombin. As described earlier, thrombin 

10 has a cleavage site preference at the carboxy end of 

-Arg-, (Y-Arg-X), within the known recognition sequence 
Gly-Pro-Arg. A discovery of the present invention is 
that thrombin also cleaves at the carboxy end of -Arg- 
(Y-Arg-X) within the cleavage recognition sequence 

15 Gly-Ala-Arg. The discovery of this recognition sequence 
renders the endopeptidase enzyme thrombin new and 
substantial utility in preparation of modified 
recombinant polypeptides by the method of this invention 
and other recombinant methodologies. In the past, the 

20 recombinant ly produced growth hormone releasing factor 
(GRF) {l-44)-NH2 was produced through the use of an 
a-amidating enzyme. By the method of the present 
invention, the amidated form of GRF may be synthesized 
through the use of an appropriate addition unit to a 

25 core, or by amidation of an addition unit after 
transpeptidation by the method of the invention. 



30 



Thrombin Transpeptidation of a 
Single Copy Recombinant Polypeptide 



The transpeptidation process of the present 
invention, utilizing the endopeptidase enzyme thrombin, 
is a one-step reaction. As discussed earlier for 
trypsin, conditions are maintained to favor the 
35 competing reaction of hydrolysis and transpeptidation. 
Within an aqueous environment, the reaction is conducted 
in a buffer solution capable of maintaining pH at about 
pH 2-13, preferably pH 3-12, and more preferably pH 
5-11. Suitable buffers for the present invention are as 
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previously described for trypsin. Using the serine 
endopeptidase thrombin as a cleavage enzyme for cleavage 
and transpeptidation, the recombinant polypeptide 
includes a GRF (1-41) core linked to leaving unit. A 
known leaving unit is Ala-Arg-Leu-Ala. It is recognized 
that there are many potential leaving units, including 
peptides and single amino acids such as -Ala-. The 
sequence of GRF (1-41) (SEQ ID NO: 7) is: 



Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 
1 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- 
15 13 

Asp-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- 
25 

20 Glu-Arg-Gly-Ala-Arg 
37 41 



25 A suitable addition unit for synthesis of GRF (l-44)-NH2 
is Ala-Arg-Leu-NH2, and for synthesis of GRF (l-44)-Gly, 
a suitable addition unit is Ala-Arg-Leu-Gly (SEQ ID 
NO:B). The present variation uses the discovery that 
thrombin recognizes the cleavage site -Arg- within a 
30 Gly-Ala-Arg recognition sequence. 

This knowledge is used to cleave the 
Ala-Arg-Leu-Ala leaving unit from the core at the -Arg- 
within Gly-Ala-Arg. To the recombinant polypeptide GRF 
(1-41) -Ala-Arg-Leu-Ala is added the addition unit at an 
35 amount of about one equivalent to 20 times the K« of the 
addition unit to the acyl-enzyme intermediate, 
preferably 10 x k" of the addition unit to the aceyl- 
enzyme intermediate. For example, Ala-Arg-Leu-NHa or 
Ala-Arg-Leu-Gly (SEQ ID NO: 8) are suitable addition 
units for synthesis of the modified recombinant GRF 
(l-44)-NH2 and GRF (l-44)-Gly products. The cleavage 
enzyme thrombin is added to the mixture in a 
thrombin:polypeptide ratio of about 1:10 to 1:500,000, 
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preferably 1:100 to 1:100,000, and more preferably 1:200 
to 1:50,000. 

The production of GRF {l-44)-NH2 is monitored by 
HPLC or other appropriate analytical technique and the 
5 reaction stopped by the addition of an enzyme inhibitor 
such as phenyl methane sulfonyl flouride (PMSF) or 
diisopropyl phosphoryl fluoridate (DPF). The modified 
recombinant polypeptide is separated from the reaction 
mixture by reverse phase chromatography, hydrophobic 
10 interaction chromatography, ion exchange chromatography, 
or HPLC. The recombinant polypeptide product may be 
stored at about -20»C to about -80°C after 
lyopholazation . 

Alternatively, the thrombin catalyzed 
15 competitive reactions of hydrolysis and 

transpepetidation may be shifted in favor of 
transpeptidation through the use of organic solvents. 
Suitable solvents for the transpeptidation reaction, 
according to the method of the invention, include DMSO 
20 and 75% v:v N,N'-dimethylacetamide and 95% v:v. Bongers 
et al., cited supra . 

2 Thrombin Transpeptidation of a Single Copy 
Recombinant Polypeptide Derived from a 
25 Fusion Protein Construct 

Recombinant GRF (l-44)-NH2 can be prepared, 
according to the method of the invention, from a 
recombinant polypeptide derived from a single copy 
30 fusion protein construct. The expression of the single 
copy fusion protein construct is described infra . In 
brief, the binding protein of the fusion protein 
construct will be connected to the single copy 
recombinant polypeptide through an interconnecting 
35 peptide. The interconnecting peptide may be a single 
amino acid which is cleavable by a chemical cleavage 
agent or a peptide which terminates with an amino acid 
sequence recognizable by a cleavage enzyme. For 
example, the tetrapeptide Asn-Gly-Pro-Arg (SEQ ID N0:9) 
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is a suitable interconnecting peptide for the fusion 

. • 4.1,= rPF f i-41)-Ala-Arg-Leu- 
protein construct containing the GRF (1 41} a 

Ala single copy recombinant polypeptide. 

once expressed, the fusion protein construct xs 
5 purified from other cell constituents, and the single 
copy recombinant polypeptide is then separated from the 
binding protein, as described in copending U.S. patent 
application Serial No. 07/552,810. the disclosure of 
which is incorporated herein by reference. For example. 
10 a human carbonic anhydrase fusion protein may be 
separated from the interconnecting peptide of the 
sequence. Asn-Gly-Pro-Arg (SEQ ID N0:9). through the use 
of guanidine hydrochloride. 

The cleaved single copy recombinant GRF (i-^J-J- 
15 Ala-Arg-Leu-Ala peptide can be separated from the 
binding protein by normal chromatographic methods 
including ion exchange, reverse phase, and sxze 
exclusion. Alternatively, the recombinant single copy 
polypeptide may be separated from the carrier protein by 
20 standard precipitation methods. The purified single 
copy recombinant GRF (1-41) -Ala-Arg-Leu-Ala is then 
treated according to the method of the invention, as 
previously described in the presence of an 
Ala-Arg-Leu-NH. or Ala-Arg-Leu-Gly (SEQ ID N0:8) addition 
25 units to yield the modified recombinant products GRF 
(l-44)-NH2 and GRF (l-44)-Gly. 

3. Thrombin Transpeptidation of a 

Mnltieopv Recombinan t Polypeptide 

in a third variation, according to a method of 
the invention, thrombin may be used to simultaneously 
cleave and transpeptidate a recombinant multicopy 
polypeptide to form the desired modified recombinant 
35 polypeptide product. In this variation, the recombinant 
multicopy polypeptide is produced by methods discussed 
section II. The multiple single copy recombinant GRF 
(1-41) cores are linked together without the use of an 



wo 95/03405 



PCTrtJS94/08125 



25 



intraconnecting peptide. The terminal GRF (1-41) core 
is linked to an -Ala- leaving unit. The GRF (41-1) 
linkages are prefixed by the thrombin recognition 
sequence -Gly-Ala-Arg- and cleavage occurs at the -Arg''- 
5 carboxy group (see SEQ ID NO: 7) 

The recombinant multicopy polypeptide is 
purified, as discussed Section II. The number of single 
copy polypeptides which may be linked within the 
recombinant multicopy polypeptide is limited only by the 

10 physical capabilities of the expression system. The 
recombinant multicopy polypeptide is added to buffer 
solution with thrombin and an addition unit, as 
described above. Also as described above, the reaction 
may be conducted in organic solvents to favor the 

15 transpeptidation reaction. 

In this variation, the thrombin recognition 
site of -Gly"-Ala*°-Arg"- is also utilized to facilitate 
the cleavage of GRF ( 1-41)-Ala-Arg-Leu-Ala at the 
-Arg"-Ala"-Arg"-Leu**-Ala" linkage of the terminal 

20 single copy recombinant polypeptide. In the presence of 
a suitable nucleophile such as Ala-Arg-Leu-NHz or 
Ala-Arg-Leu-Gly (SEQ ID NO: 8), the desired modified 
recombinant polypeptide products are produced through 
transpeptidation simultaneous with cleavage of the 

25 multicopy recombinant polypeptide. 

4 Thrombin Transpeptidation of a Multicopy 
Recombinant Polypeptide Derived from a 
Fusion Protein Construct 

The modified recombinant multicopy polypeptide 
products may be produced, according to the method of the 
invention by transpeptidation of recombinant single copy 
core polypeptide units which have been derived from a 
35 multicopy polypeptide unit which has been derived from a 
fusion protein construct. The number of recombinant 
single copy core polypeptides included within the 
recombinant multicopy fusion protein construct is 
limited only by the physical capabilities of the 
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expression system. Purification of the recombinant 
multicopy polypeptide from the fusion protein 
construction is as previously described. The purified 
recombinant multicopy polypeptide is separated from the 
fusion protein construct and then treated with thrombin 
endopeptidasG enzyme, as described above, to yield the 
desired modified recombinant polypeptide product. 

The multicopy fusion protein construct is 
prepared as described Section II. 

II. Forming the Recombinant Single- or 

Multicopy Polypeptide and the Single- or 
Multicopy Recombinant Fusion P rotein Construct 

The recombinant single- or multicopy 
15 polypeptide or the single- or multicopy recombinant 
fusion protein construct is formed by recombinant DNA 
methods disclosed in U.S. application Serial No. 
07/552,810, the disclosure of which is incorporated 
herein by reference. The gene sequence for the desired 
20 recombinant polypeptide can be cloned or, in the case of 
a smaller peptide, synthesized by automated synthesis. 
The gene sequence encoding the leaving unit is linked at 
the C-terminal end of the core polypeptide. 

For conciseness, the term "fusion protein 
25 construct" will be used to refer to either the single- 
or multicopy recombinant fusion protein. The term 
"polypeptide construct" will be used to generically 
refer to the recombinant single- or multicopy 
polypeptide. 

30 The expression vector containing the 

recombinant gene for a polypeptide construct or fusion 
protein construct is capable of directing expression of 
the recombinant gene in prokaryotic or eukaryotic cells. 
The expression vector incorporates the recombinant gene 

35 and base vector segments such as the appropriate 

regulatory DNA sequences for transcription, translation, 
phenotyping, temporal or other control of expression, 
RNA binding and post-expression manipulation of the 
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expressed product. The expression vector generally will 
include structural features such as a promoter, an 
operator, a regulatory sequence and a transcription 
termination signal. The expression vector can be 
synthesized from any base vector that is compatible with 
the host cell or higher organism and will provide the 
foregoing features. The regulatory sequences of the 
expression vector will be specifically compatible or 
adapted in some fashion to be compatible with 
prokaryotic or eukaryotic host cells or higher 
organisms. Post-expression regulatory sequences, which 
cause secretion of the polypeptide construct can be 
included in the eukaryotic expression vector. It is 
especially preferred that the expression vector exhibit 
15 a stimulatory effect upon the host cell or higher 
organism such that the polypeptide construct is 
overproduced relative to the usual biosynthetic 
expression of the host. 

Transformed prokaryotic or eukaryotic cells or 
20 higher organisms carrying the appropriate recombinant 
prokaryotic or eukaryotic vectors constitute the 
transformed cells of this invention. The prokaryotic 
cells useful as hosts include any that are amenable to 
expression of foreign protein. Preferred embodiments 
25 include coli and subtilis . The eukaryotic cells 
include unicellular organisms, such as yeast cells, as 
well as immortal cells from higher organisms, such as 
plant, insect or mammalian cells. Preferred eukaryotic 
cells include Raccharomvces cerevisiae, Pichia pastoris, 
a^pPT- qillus niaer . Soodoptera frupiperda, and corn, 
tobacco or soybean plant cells. The higher organisms 
useful as hosts include higher order plants and animals 
having germ cells that are amenable to transformation. 
Included are plants such as tobacco, corn, soybean and 
35 fruit bearing plants, and invertebrate and vertebrate 
animals such as fish, birds and mammals especially 
including sheep, goats, cows, horses and pigs. 
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The invention as well includes a cultured, 
transformed cell or transformed plants or animals that 
are capable of expressing the fusion protexn or 
polypeptide construct composed of a core of at least one 
5 leaving unit, wherein the leaving unit is linked to the 
core by an enzyme cleavage site and may be substituted 
by an addition unit when transpeptidated by an 
endopeptidase cleavage enzyme. 

The expression steps of the method according to 
10 the present invention are based upon microbial or higher 
organism protein expression. The steps call for 
inserting the recombinant gene into an appropriate base 
vector, transforming host cells or higher organisms wxth 
the resulting recombinant vector and expressing the 
15 polypeptide construct or fusion protein construction, 
preferably as a soluble product within the host cell or 
higher organism, as a product that is insoluble in the 
cell cytoplasm, or as a secreted product by the host 
cell or higher organism. When higher organisms are 
20 chosen as the host, fertilized germ cells of that 

organism are transformed and the transformed organism 
grown through usual maturation techniques. 

The purification steps of a polypeptide 
construct call for separating the polypeptide construct 
25 from other cellular constituents, debris, and culture 
medium. The purification steps of a fusion protein 
construct call for affinity binding of the fusion 
protein construct to an immobilized ligand, and 
separating it from other cellular constituents, debris 
30 and culture medium. The polypeptide portion of the 
fusion protein construct is obtained from the 
immobilized fusion protein construct through enzymatic 
or chemical cleavage action upon the interconnecting 
peptide, and separating the variable fused polypeptide 
35 from the cleavage enzyme or other material. (Throughout 
this application, mention of enzymatic or chemical 
cleavage alone will be understood to include both.) 
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Alternatively, the purification steps can 
separate the entire fusion protein construct from the 
immobilized ligand after purification and cleave it with 
an immobilized cleavage enzyme or chemical reagent to 
5 produce a mixture containing the variable fused 

polypeptide and binding protein. This mixture can be 
separated by use of an immobilized ligand for the 
binding protein and removal of the purified polypeptide 
construct. 

IQ Preferred embodiments of the method include 

those expressing the recombinant gene composed of DNA 
segments for human carbonic anhydrase (or a modified 
functional version thereof), interconnecting peptide and 
a recombinant single copy polypeptide or multiple units 
15 thereof. Additional preferred embodiments include use 
of Ej. coli or yeast as the host cells and use of 
controlled expression by means of any induction system 
such as temperature, nutrients, isopropyl 
thiogalactoside, indole acrylic acid, carbon sources and 
the like, so as to allow the production of a protein 
purification construct that would be toxic to the host. 
Further preferred embodiments include use of an 
expression vector system for prokaryotic cells which 
incorporates a two plasmid construction, and an 
25 expression vector system for yeast cells which 
incorporates a shuttle vector with an origin of 
replication for coli and one for cerevisiae. 

Due to E. coli digestion of single copy 
recombinant ion polypeptide expressed intracellular ly, 
30 incorporation into a fusion protein construct is 

required. The possibility of an E. coli organism, which 
does not degrade intracellularly expressed single copy 
recombinant polypeptides not attached to a carrier 
protein, is recognized according to the method of the 
35 invention. 
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A. Recombinant Polyp ptide Production 
fr-nm a Fusiop Protein Construct 

1. Method for Expression of Host Cells 
5 * of Fusion Protein ConstrTict 

The methods for expression of single- and 
multicopy recombinant fusion protein products disclosed 
in U.S. patent application Serial No. 07/552,810, filed 
10 July 16, 1990, the disclosure of which is incorporated 

herein by reference. 

AS discussed in U.S. Serial No. 07/552,810, the 
use of multicopy or single copy recombinant fusion 
proteins allows for the highly efficient purification of 
15 recombinant polypeptides. The construct of a 

recombinant fusion protein has a three tandem segments. 
The first segment is a binding protein which exhibits 
strong, reversible binding to a specific small molecular 
weight ligand. The second segment is an interconnecting 
20 peptide which is selectively cleavable by an enzyme or 
chemical technique. The interconnecting peptide 
connects the binding protein to the N- or C-terminal end 
of the recombinant single copy or multicopy polypeptide. 
It is typically a short chain peptide. It is preferred 

25 to construct the fusion protein construct gene so that 
the binding protein gene fragment is read first. The 
third segment, the variable fused polypeptide, 
incorporates any natural or synthetic polypeptide 
desired as a starting product for the method of the 

30 invention. 

2. Method of Purification of 
Fusion Protein Construct 

35 The recombinant single or multicopy polypeptide 

produced as a fusion protein allows for easy 
purification by affinity chromatography. The fusion 
protein produced in the transformed cells can be soluble 
in the cells or insoluble in inclusion bodies. Soluble 

40 fusion protein construct is obtained by lysis of the 
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transformed cells to form a crude cell lysate. The 
crude cell lysate can be further purified by methods 
including ultrafiltration and ion exchange 
chromatography before purification by affinity 
chromatography. Insoluble fusion protein in inclusion 
bodies is also purified by similar methods. 

To perform affinity purification, the crude 
mixture of materials is combined with an immobilized 
ligand for the binding protein. Examples of the binding 
protein, corresponding ligand and dissociation constants 
are given in Table 1. A complete discussion of the 
method of purification of the fusion protein construct 
is found in copending application Serial No. 07/552,810, 
the disclosure of which is incorporated herein by 
15 reference. 
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TABLE 1 



R^TiHinp; Protein 
5 Xanthine Oxidase 
Adenosine deaminase 
Adenosine deaminase 
Adenosine deaminase 

10 Dihydrofolate reductase 

Dihydrofolate reductase 

Dihydrofolate reductase 

Dihydrofolate reductase 

Ribulose bisphosphate 
15 carboxylase 

Pepsin I 

Calmodulin 

Calmodulin 

Cholesterol esterase 

20 Carbonic anhydrase II 

Carbonic anhydrase II 



Licand 


Kd 


Ref . 


All nTiiirinol 


strong 


1 


Cof ormycin 


<1.2E-10 


1 


Deoxycoformycin 


2 .SE-12 


2 


erythro-9- (2-nyaroxy- j 


1.6E-9 


2 


nonyl ) adenine 






Methotrexate 


1.2E-9 


4 


Methotrexate 


2.3E-9 


5 


rUiixiiujLi 


3.7E-9 


5 


Trimethoprin 


A.6E-9 


5 


2 carboxyarabirital 


lE-14 


6 


1,5 bisphosphate 






Pepstatin 


lOE-9 




Me lit tin 


3E-9 


7 


Various peptides 


0.2E-9 


7 


Borinic acid 


O.lE-9 


a 


Sulfanilamide 


4.6E.7 


3 


Acetazolamide 


6 E-IO 


3 



25 





1. 


30 






2. 




3. 


35 


4. 




5. 




6. 


40 






7. 




8. 



E is times ten to the negative exponent indicated. 

Ref CTences Cited in Table 1 
Cha et al., R^nrhemical Pbarm. . 24, 2187-2197. 
Agarwal et al., Rinrhemical Pbarm.. 26, 354-367 (1977). 
Taylor, P.W. et al., Biochemistry , 9. 2638 (1970). 
Cha et al, Biochemical Pharm., 30. 1507-1515 (1981). 
Williams et al.. Rinrhemical Pharm. . 29. 589-595 (1980). 
Pierce, J., Tolbert. N.E., Barker, R.. Biochem.. 19:934-942 (1980). 
Degrade et al., 1 nf Cellular Biochem. , 29. 83-93 (1989). 
Sutton et aL, BBRC . 134, 386-392 (1986). 



45 



wo 95/03405 



PCTAJS94/08125 



33 



For the preferred carbonic anhydrase enzyme, 
the ligand is sulfanilamide or a benzene sulfonamide 
derivative. Immobilization of the ligand on a solid 
support can be accomplished by the methods of W. 
5 scouter, Methods Enzvmol. , 3A, 288-294 (1974); S. 

Marcus, N-^rhnds Enzvmol. . 34. 377-385 (1974); A. Matsura 
et al.. Methods Enzvmol. , 34, 303-4 (1974); R. Barker, 
M^^hnH^ Enzvmol. , 34, 317-328 (1974); I. Matsumoto, 
Mo^hods Enzvmol. . 34, 324-341 (1974), J. Johansen, 
10 n.^icv^^rn Res. Commun., 14, 73 (1976) and G. S. Bethell 
et al., .1. Biol. Chem. , 254, 2572-2574 (1979); the 
disclosures of which are incorporated herein by 
reference. The fusion protein binds to the iimnobilized 
ligand through the reversible affinity of the binding 
15 protein for its ligand. The remaining constituents and 
debris of the crude mixture of materials can then be 
removed by washing or similar techniques. 

Two routes can be employed for further 
purification of the fusion protein. According to the 
20 first route, the single or multicopy fusion protein is 
dissociated intact from the immobilized ligand by 
washing with a strong competing ligand solution. 
Examples include cyanides, pseudocyanides such as 
thiocyanides, perchlorates , halide and similar strong 

25 Lewis bases. 

According to the second route, the immobilized 
single or multicopy fusion protein is contacted directly 
with cleavage reagent to release the single or multicopy 
polypeptide. To isolate the single or multicopy 
30 polypeptide in the second route, its mixture with 
cleavage enzyme can be combined with a means for 
molecular weight selection (e.g. partition 
chromatography dialysis, filtration based on molecular 
size or high pressure liquid chromatography on a 
35 "particle exclusion" base or ion exchange 

chromatography) such that the high molecular weight 
cleavage enzyme is separated from the free variable 
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enzymes and their cleavage sites are gi 
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15 



20 



25 



3D 
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Enzymes 

fr>r Cleavage 

Enterokinase 

Factor Xa 
Thrombin 

Ubiquitin Cleaving 
Renin 



TABLE 2 

DMA Seq. 

GACGACGACGATAAA 

(SEQ ID NO: 10) 

ATTGAAGGAAGA 

(SEQ ID NO: 11) 

AGAGGACCAAGA 

(SEQ ID NO: 12) 

Enzyme AGAGGAGGA „ 
^ (SEQ ID NO: 13) 

CATCCTTTTCATCTGCTGGTTTAT 
(SEQ ID NO: 14) 



Trypsin 


AAA OR CGT 




Chymotrypsin 


TTT or TAT 


Clostripain 


CGT 


S. aureus V8 


GAA 


Chemical 


DNA Sea. 


rleavaae 


(at pH3) 


GATGGA 


( Hydroxy 1 amine ) 


AATCCA 


(CNBr) 


ATG 




BNPS-skatole 


TGG 


2-Nitro-5- thiocyanobenzoate TGT 



50 
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In 



The purification methods described above yield 
the starting materials for the method of the invention: 
a single copy recombinant fusion protein, a multicopy 
recombinant fusion protein, a single copy recombinant 
polypeptide, or a multicopy recombinant polypeptide, 
a preferred embodiment, the recombinant single and 
multicopy polypeptides are produced from a fusion 
protein- 

B. Recombinant Polypeptide Production from 
a Recombinant Polypeptide 

1 Recombinant Method for Expression of 
Host Cells of Multicopy Poly peptide 

The method for expression of single- and 
multicopy recombinant polypeptide, i.e. a polypeptide 
expressed with a leader sequence, a limiting protein or 
an affinity moiety attached to it, are known in the art 
and described in Protein Pur ification; From Mechanisms 
to Larae-Rf^ale Processes . Michael Ladisch, editor; 
American Chemical Society, publisher (1990), the 
disclosure of which is incorporated herein by reference. 

2. Method of Purification of 

Recombinant Multicc pY Polvpeotide 

The method for purification of a recombinant 
multicopy polypeptide is known in the art and is 
described in Kirshner et al., J. Biotechnology, 12:247- 
260 (1989), the disclosure of which is incorporated 
herein by reference. 

35 III. Therapeutic Use of Recombinant Modified Polypeptide 
^ t°:L^^.c Produced bv t -h^ Method of <-he Invention 

The products of the present invention have 
significant therapeutic and supplemental physiological 
40 uses in clinical human and veterinary medical practice. 
For example, the insulinotrophic activity of GLPl (7- 
36)-NH2 has been shown to be beneficial in treating the 
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(NIDDM, Type II). Gutnxak, f---^^ for 

5 and osteoporosis . In livestock 
.eon used to i--:--/^^;;:;;,,^^, „eat products. 

Ttrro^ "pLraratl o. pKa«aceut-U. 
functional compositions o. the P'--^; ;;;,,^ 

acceptable carrier, »^„inistered in an 

rx"::: rterorarnit.t.o„. .r 

"drpendent on purpose .r use and t.e 
15 condition of the recipient. 

EXAMPM 1 

.ne naturally occurring sequence of Glucagon 
Like peptide 1 (GLPl) (SEQ ID N0:15) is: 
25 His-Asp-Glu-Phe-Glu-Arg-His-Ala 

Glu-Gly-Thr-Phe-Thr-Ser-Asp-Val-Ser 
Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala 



30 



35 



Lys-Glu-Phe-Ue-Ala-Trp-Leu 
26 27 

val-Lys-Gly-Arg- NHj 
34 36 



40 



A GLPl peptide is a 36 amino acid peptide that 
^I^inaltly produced but without a mechanxsn> 
has been recombinant iy pro r.terminal 

■Air^n for the amidation of the C-termin.* 
for providing for tne method of the 

arginine residue. In this example, the 
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invention has been designed to produce a single copy 
fusion protein construct containing one copy of a gene 
encoding a truncated core GLPl and an,idating the core 
GLPl by a transpeptidation reaction, using the 
endopeptidase trypsin, to form a modified recombinant 

GLPl polypeptide. 

The strategy involves forming a DNA construct 
encoding a single copy recombinant fusion protein. The 
single copy fusion protein includes at least three 
segments. The first segment is a binding protein whxch 
exhibits strong reversible binding to a specific small 
molecular weight ligand. The second segment is an 
interconnecting peptide which is selectively cleavable 
by an enzyme or chemical technique. The third segment 
15 is a variable fused peptide containing one copy of the 
desired natural or synthetic polypeptide, in this case 
GLPl (7-34). The formation of a DNA construct for the 
fusion protein, as well as the fusion protein itself, 
has been described in copending U.S. Application Serial 
No. 07/552,810 filed July 15, 1990, which is hereby 
incorporated by reference. 

The single copy fusion protein can be formed 
with human carbonic anhydrase modified at residue 240 as 
the binding protein. The modification of carbonic 
anhydrase at residues 240 involves a substitution of a 
leucine for a methionine. The interconnecting peptide 
is a methionine residue which can be cleaved by cyanogen 
bromide. The variable fused polypeptide contains a 
single copy of a modified truncated GLPl peptide having 
the following sequence (SEQ ID N0:2): 
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25 
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His-Ala-Glu-Gly-Thr-Phe-Thr- 
7 

Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu- 

5 

Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile- 

10 Ala-Trp-Leu-Val-Lys-Ala-Phe-Ala 

The core GLPl peptide is truncated fro- the 
native se<^ence so that it contain, residues 
15 corresponding to residues 7-34 of the """""^ 

occurring sequence. The GLPl peptide iS »od.fred by the 
iilge !f - Aia-Phe-Ala leaving unit at residues 35- 
T This tripeptide is not found in the naturally 
occurring se,r.ence and is a good leaving group for 
,0 tr^: n Lanrpeptidation. Briefly, this sing e copy 

.n^n^inant fusion protein can be ^""/^ ^^he 

construct forced as follows. The DHA -^^^ 
human carbonic anhydrase II gene is modified so that the 
!^thionine codon at a„ino acid residue 240 .s -placed 
„ Tit^a leucine codon using cite directed mutagenesis, as 
described in Sambroo. et al., « n,«r„1.. Clonxng, A 
^^^,3,,_Hanual, Cold Spring Harbor 

■,1989) The modified gene for human carbonxc anhydrase 
is then cloned into an expression vector which iS 
30 o^tible with E^. -ch as pB0304, as described rn 
D S application Serial Bo. 07/552,810. A non- 
elsentUl preferred embodiment is a short ^A fragment 
including the codon for methionine is chemrcally 
sy«hesi!ed and inserted l^ediately downstream from the 
35 e^of the gene for human carbonic anhydrase by standard 
methods. A 0»A sequence encoding the truncated core 
GLPl (7-34)-Ala-Phe-Ala polypeptide is formed by 
TtLlted L synthesis a^ inserted directly downstream 
from the interconnecting DHA segment encoding the 
40 methionine codon. The final recombinant expressron 
vector encoding the single copy fusion protein xs 
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transfonned into by standard methods and the 

expressed reccnbinant single copy fusion protexn can be 
obtained using affinity chromatography w.th 
sulfanilamide or by other chromatographic methods. Once 
5 the recombinant fusion protein is purified, xt can be 
cleaved and transpeptidated. 

Cleavage and transpeptidation can be conducted 
as follows. For example, a 40 mg/ml solution of HCA- 
Met-GLPl (7-34)-Ala-Phe.Ala can be digested wxth a 
XO 50-fold excess of cyanogen bromide (CNBr) -^^^^-^ 
70% formic acid to release the GLPl ( 7-34 ) -Ala-Phe-Ala 
peptide. The reaction mixture can be incubated xn the 
dark under oxygen-free nitrogen at 20--25-C for 16-24 
hours. The reaction mixture is diluted with 15 volumes 
15 of water and freeze dried. For the --^^'^^^^'^^''^^j' 
acid and by-products, the freeze drying can be repeated 
after further addition of water. This cleavage reactxon 
yields human carbonic anhydrase and the recombinant GLPl 
(7-34) -Ala-Phe-Ala polypeptide. 
20 The cleaved GLPl 7-34 Ala-Phe-Ala polypeptxde 

can be separated from human carbonic anhydrase by normal 
chromatographic methods, i.e., ion exchange, reverse 
phase, or by size exclusion. In addition, the cleaved 
GLPl (7-34) -Ala-Phe-Ala polypeptide can be separated 
25 from the human carbonic anhydrase by simple 

precipitation procedure. A solution contaxnxng carbonxc 
anhydrase, 70% formic acid, cyanogen bromxde, 
methionine, and peptide is diluted with water to a 
protein concentration of 20 mg/ml while maintaxnxng an 
30 acetic acid concentration of 10%. The addition of 5.6 
g/100 ml of sodium sulfate (Na.SO,) to this mixture 
results in a precipitate which can be removed by 
centrifugation at 10,000 x g for 10 minutes. The 
carbonic anhydrase can be quantitatively precipitated 
35 and greater than 80% of the peptide remains in solutxon. 
The supernatant can be applied to an open C-8 column 
which is rinsed with four column volumes of 10% acetxc 



PCT/US94/08125 



WO 95/03405 



40 



.id The GLPl (7-34, -Xl.-Phe-Al, c=n be eluted £ro. 
acid. The i.i.*'-^ v ^„„.trile in 10% acetic acid, 

the column with 50% acetonxtrile xn 

^ooftdated to yield the modified recombinant 
be transpeptidatea to yi« ^ ^ fnllows. 

native GI^l 7-36.m. a»ino acid product « 

The recombinaht GLPl (7-34)-Ala-Phe-Ala 
pol^ptide can ^ cleaved with trypsin at the c eav ,e 
site between amino acid residues 34 and 35 at the Ly 

bond in the reco..inant ^^^X^T^^^^ " 
..ypsin did not Cleave the .ys-Glu GLPl 

.„d 2, in e=.P-^n" 3 "Vhtle not in any 
polypeptide as shown in SEC ID BO. ^^^^ 

way .eant to "---^"J ^ f J3in is not favored 
i cleavage at residues 25 ana zi uj jt- 

ZJ. of the presence of the acidic glutamic acid 

The Cleavage with trypsin is conducted in^He 

p.esence of either ^-.^PrAa " 

0 units so that the cleavage of the ^^^^^-^ " 

unit is followed by the addition of Gly-«g-KH, or Gly 
C-0 y " the core GLPl (7-34, polypeptide to yield 
"Iher amidated 7-36 GtPl-W. polypeptide or Gl^l 7-36 
peptide with a terminal glycine. 
,5 For example, the freeze dried J'-^*' 

.Ma-Phe-;.la is dissolved at 10 mg/ml in a buffer a. 
pH 5-11 with 0.01 to 1 M Gly-Arg-NH. or ^^J-^';"^. 
Laving unit which contains bovine trypsin at a 1.100 
«tio 'trypsin.peptide, at 37-C. The mixture was 
30 "11 :fin, a magnetic stirrer at 1000 rpm_The 
trypsin cleaves the Ala-Phe-Ala from the carboxy 
terminus of the core and forms an ^-y^-^'T: 
intermediate to residue 34 of the core The Gly Arg 
or Gly-Arg-Gly acts as a nucleophile f--""^ 
transpeptldation of the acyl-enzyme intermediate. The 



35 

first reaction is: 
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GLPl 7-34 Ala-Phe-Ala + Gly-Arg-NH, T^r^s^in . -> 
GLPl (7-36)-NH2 

5 

The second reaction is: 
10 GLPl (7-34) Ala-Phe-Ala Gly-Arg-Gly ^gmiS- "> 

GLPl (7-36)-Gly 



15 



20 



25 



30 



35 
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The production of GLPl 7-36-NH2 or GLPl 7-36-Gly 
is monitored by HLPC and the reaction stopped by the 
addition of 2 M HCl until the pH is below 3. As 
described by Bongers et al., Tnt J Peptide Protein 
Res., 40:268 (1992), the GLPl (7-36)-Gly can be 
converted to an amide in a later reaction by use of the 
C-terminal a-amidating enzyme as described in Ohsuye et 
al., cited supra . 

EXAMPLE 2 

Preparation of Amidated Recombinant ^LPl (7-36)-HH, 
"^^ %^r.n, « Multicopy Protein Construct 

Amidated recombinant GLPl 7-36-NH, was prepared 
from a multicopy fusion protein containing four copies 
of a modified truncated GLPl peptide having amino acxd 
residues 1-34 of the native or naturally occurring 
polypeptide and the terminal amino acid residues of 
Ala-Phe-Ala at residues 35-37. 

A DNA construct encoding a multicopy fusion 
protein can be prepared as described in Example 1. 
Briefly, a non-essential preferred embodiment is a gene 
encoding human carbonic anhydrase modified so that the 
codon for methionine at amino acid residue 240 is 
replaced with the codon for leucine and subcloned into a 
vector that can be expressed in E,_coli such as pB0304, 
as described in U.S. application Serial No. 07/552,810. 
The DNA sequence for the interconnecting peptide 
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encoding a methionine residue, the DKA se^ence encoding 
four copies of the prefixed recombinant GLPl ( l- 
34)-Ala-Phe-Ala polypeptide can be synthesized by 
automated DNA synthesis with the ' 
, the DNA sequence encoding the truncated modxfxed GLPl 
sequence. This DNA sequence is then inserted 
JTediately downstream from the gene for human carbonxc 
anhydrase in the E. coli expression vector by standard 
methods. The expression vector encoding the multicopy 
) fusion protein is then transformed into E. colx . 

TransfoLants are selected and amplified. The mu t.copy 
fusion protein is recovered and purified from cell 
lysates as described in Example 1. 

once purified, the multicopy fusion protexn xs 
5 cleaved with cyanogen bromide as described in Example 1 
to yield human carbonic anhydrase and a multicopy 
protein containing four copies of the truncated 
GLPl (1-34) -Ala-Phe-Ala polypeptide. The multicopy 
peptide can be separated from human carbonic anhydrase 
0 by standard chromatographic methods such as xon 

exchange, reverse phase or size exclusion or by the 
precipitation method described in Example 1. 

The multicopy polypeptide is then cleaved and 
transpeptidated yith trypsin as follows. Trypsin wxll 
25 cleave the multicopy polypeptide into single copy 
polypeptides between amino acid residues 6-7 and 
residues 34 and 35 to yield four single copies of GLP 
,7-34) and peptides containing Ala-Phe-Ala- connected to 
Lino acid residues 1-6. When the cleavage is conducted 
30 in the presence of an appropriate nucleophilic addxtxon 
unit, such as Gly-Arg-NH„ transpeptidation occurs. For 
example, freeze dried multicopy polypeptide is dissolved 
at 10 mg/ml and a buffer at pH 5-11 with .01 to 1 ml 
Gly-Arg-NH2 which contains trypsin at a 1:1000 ratxo 
35 (trypsin:peptide). The trypsin cleaves the multicopy 
peptide as described above to yield GLPl (7-34) core 
polypeptide which forms an acyl-enzyme intermediate wxth 
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the trypsin. The Gly-Arg-NH. acts as a 
transpeptidation occurs at amino acid resxdue 34 The 
transp p /7-36)-NH, is monitored by HPLC and 

production of GLPl (/--Jb) 

the reaction stopped by the addition of HCl when the 
5 reaction has reached completion. 

EXAMPLE 3 

preparation of Amidated Recombinantly Produced 
10 "^orSI (7t°6^-KH ° M.alticopv Polypeptide 

Modified recombinant GLPl (?-36)-NH2 can also be 
prepared by cleavage and transpeptidation of a multicopy 
polypeptide. The multicopy polypeptide was formed wxth 
15 four copies of core GLPl (7-34) connected to a terminal 
core GLPl (7-34) linked to a Ala-Phe-Ala leaving unxt. 

A DMA construct encoding the reconibinant 
multicopy polypeptide can be formed as described for a 
multicopy or single copy recombinant fusion protexn as 
20 described in Examples 1 and 2, but without the carbonxc 
anhydrase as fusion protein or the methionine codon as 
interconnecting peptide. A DNA sequence encoding four 
ropies of the GLPl (7-34) core polypeptide and a 
terminal GLPl (7-34) -Ala-Phe-Ala recombinant 
• 25 polypeptide can be synthesized by automated DNA 

lynthesis. The DNA sequence is then subcloned into an 
expression vector compatible with E^_coli and 
transformed into E^coli. Transformants expressing the 
recombinant multicopy polypeptides were selected and 
30 amplified, it is likely that the recombinant multicopy 
polypeptide will be found in inclusion bodies. The 
recombinant multicopy polypeptide can be purified from 
inclusion bodies as follows. 

Cells are lysed with sonication in 50 ml 
35 Tris-Hcl (PH=7.9) and 2.5 ml EDTA containing 100 mM NaCl 
with 10 micrograms of DNase 1. Lysozyme (30 ml) is 
added and the lysate is incubated overnight to disrupt 
the cell fragments. To purify recombinant polypeptide 
from insoluble granules, the lysate is then centrifuged 
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.nd the insoluble granules are incubated with sod^u^ 
::o:cycholate. and .ashed several ti»es The .nclu on 
bodies are then frozen. The thawed inclusion bodies are 
farther purified by ultrafiltration and DEAE 
chrcatography after being dissolved in an 
chaotropic reagent, such as urea, guanidxne, or 50 m 

NaOH. 

once purified, the recombinant multicopy 
polypeptide is cleaved and transpeptidated with trypsin. 
Try^in will cleave at the -Lys- at residue 34 to yxeld 
single copies of the core GLPl (7-34) and a copy of a 
GLPl (7.34)-Ala-Phe-Ala. The GLPl (7-34)-Ma-Phe-A a 
will also be cleaved by trypsin to yield GLPl (7-34) 
core and the leaving unit Ala-Phe-Ala. The trypsin 
cleavage of the multicopy polypeptide is conducted xn 
the presence of a nucleophilic addition 

Gly-Arg-NH, so that the final product is a GLPl {7-36)NH. 
modified recombinant polypeptide as a result of trypsin 
catalyzed transpeptidation. 

EXAMPLE 4 

Fiiftlon Protein Cons truct: 
A modified recombinant growth hormone releasing 
factor can be prepared by cleavage and transpeptidation 
of a recombinant multicopy fusion protein. The natxve 
30 or naturally occurring sequence of growth hormone 
releasing factor (SEQ ID N0:16) is: 

Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 
1 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-LBu-Leu-Gln- 



20 



25 



13 



Asp 
40 25 



-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- 



Glu-Arg-Gly-Ala-Arg-Ala-Arg-Leu-NH2 
37 
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A recombinantly produced growth hormone 
releasing factor (GRF) is not produced in the highly 
active amidated form and an additional step using an 
a-amidating enzyme is typically necessary. However, a 
strategy can be designed to form the amidated GRF by 
combining cleavage of a recombinant single copy fusion 
protein with transpeptidation. 

A DNA construct encoding a single copy fusion 
protein can be formed as described in Example 1. 
Briefly, the gene for human carbonic anhydrase is 
subcloned into a E. coli expression vector such as 
PB0304, as described in U.S. application Serial No. 
15 07/552,810. The DNA sequence encoding an 

interconnecting peptide of the following sequence (SEQ 

ID NO: 9) : 

Asn-Gly-Pro-Arg 
is synthesized by automated DNA synthesis. A DNA 
sequence encoding a truncated core GRF polypeptide and 
the leaving unit -Ala-, for example GRF (1-4 1 ) -Ala-Arg- 
Leu-Ala, having the following sequence (SEQ ID NO: 17): 
Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- 
25 1 

Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- 
13 

Asp-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- 
25 



20 



30 



Glu-Arg-Gly-Ala-Arg— Ala-Arg-Leu-Ala 
37 41 45 



35 



40 



is synthesized by automated DNA synthetic methods. The 
terminal Ala-residue is added because it serves as a 
good leaving unit for the cleavage and transpeptidation 
reaction. The DNA sequence for the interconnecting 
peptide and the truncated modified GRF (1-41) -Ala (SEQ 
ID NO: 17) peptide can be synthesized together as a 
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single sequence or separately and then subcloned 
singio = M human carbonic 

immediately downstream from the gene 

anhydrase to form the expression vector for the fusion 
Trlin. .he expression vector is then transformed into 
I coli and transformants are selected and ^plxfred^ 
;;r^:;ion protem is isolated and purified from cell 
lysates using affinity chromatography as described « 

Example 1^^^^ ^^^^^^^ human carbonic anhydrase 

, fusion protein is digested with 7 » ""'""'/f \ " ^. 
guanidine hydrochloride to release the GRF ll-">-W- 
Arg-Lsu-Ala peptide from the fusion protein. The 
cleaved GRF 1-4 1-Ala-Arg-Leu-Ala peptide can be 
separated from human carbonic anhydrase by normal 
, Chromatographic methods, i.e. ion "'^"^^.^ 
phase and size exclusion. Alternatively, P^?"^^ 
can be separated from the carrier protein by '•^l""- °^ 
the reaction mixture with water and acetic acid so that 
the concentration of acetic acid is made at 10% 
0 volume/volume ,v/v,. The addition 5.6 g/100 ml sodi^ 
sulfate ,Na,So., to this mixture results in a P-"P^"" 
Which can be removed by centrifugation at 10,000 x g for 
10 minutes. The human carbonic anhydrase is selectively 
precipitated from the reaction mixture. The supernatant 
.5 is applied to an open C-8 column which is rinsed with 
four column volumes of 10% acetic acid and the pept de 
is eluted from the column with 50% acetonitrrle in 10* 
acetic acid. The peptide is then freeze dried. 

For cleavage and transpeptidation, the purified 
30 GRF l-41-»la peptide is then cleaved with ^^"'^"^^^ 

the presence of either Ala-Ar,-Leu-NH, or Ala-Arg-Leu-Gly 
(SEO ID N0.8). -The purified GRF (1-41)-Ala is dissolved 
at 10 mg/ml in a buffer at pH 5-11 with 0.01 to 1 « 
Ala-Arg-Leu-NH, or Ala-Arg-Leu-Gly (SEO ID S0=8) which 
35 contains thrombin at a 1=3000 ratio (thrombin:peptide) . 
It has been discovered that the GAR se<rience at residues 
39-41 m the GRF (1-41) peptide (SEQ ID B0=7) is a site 



wo 95/03405 



PCTAJS94/08125 



47 



10 



recognized and cleaved by thrombin. The 
Cleaves the Ma fro™ the carboxyl terminus and fo^s an 
acyl-enzyme intermediate. The Ala-Arg-Leu-NH, or Ala- 
Arq-Leu-Gly (SEQ ID N0:8) act as a nucleophile and 
transpeptidation occurs as follows: 

Reaction 1: 

GBP (l.41)-«a-Arg-Leu-Ala . Wa-Arg-Leu-»H, m^-^' 
GRF (1-41) Ala-Arg-Leu-HH, + Ala-Arg-Leu-Ala 



15 Reaction 2: 

GRF (1.41).Ala-Arg>Leu-Ala -h Ala-Arg-Leu-Gly Thrombin - 
> pH = 5-11 



20 



GRF (1-41) Ala-Arg-Leu-Gly + Ala-Arg-Leu-Ala . 



The final product of reaction 1 corresponds to 
25 the amidated native GRF (l-44)-NH.. The final product of 
reaction 2 corresponds to GRF (l-44)-Gly. The GRF 1- 
44.Gly can be converted to the amide by later reaction 
using a C-terminal a-amidating enzyme. 

EXAMPLE 5 

30 

preparation of Amidated GRF a 

35 Amidated recombinant GRF (l-44)-NH2 can be 

prepared from a recombinant multicopy polypeptide by 
cleavage and transpeptidation. 

The recombinant multicopy peptide is produced 
by cells transformed with an expression vector. A DNA 
40 construct is formed by joining four copies of the codxng 
sequence for a truncated GRF (1-41) joined end to end 
and having a terminal DNA sequence encoding a modified 
truncated GRF (l-41)-ala peptide. This DNA construct is 
formed by automated DNA synthesis and subcloned into a 
45 E. coli expression vector such pB0304. The expression 
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vector is then transfonned into and 
transfomants are selected and then an,pUf.ed. The 
l^ticopy polypeptide is isolated fro. cell lysates as 

described in Example 3. ^.ho is 

Orice purified, the niulticopy polypeptide is 
cleaved and transpeptidated with thrombin. Thrombin 
Cleaves after the GAR sequences of residues 39-.1 xn the 
1-41 peptide to yield single "P^^^ °^;-^7^^^^ ""^^ 
(1.41) and a modified truncated GRF 1-41-ala. The 
10 modified truncated GRF 1.41-ala is also cleaved by 
Thrombin to yield GRF 1-41 and alanine. 
with thrombin is conducted in the presence of Ala-Arg- 
Leu-NH,. The Ala-Arg-Leu-NH, acts a nucleophUe 
resulting in transpeptidation as follows: 

Slltii-StArg-Leu-Ala . Ala-Arg-« ^^-2^ -> 

20 GRF 1-44-NHa + Ala-Arg-Leu-Ala 

The final product is amidated native GRF (l-44)-NH,. 

25 All publications and patent applications in 

this specification are indicative of the level of 
ordinary skill in the art to which this invention 
pertains. All publications and patent applications are 
herein incorporated by reference to the same extent as 
30 if each individual publication or patent application was 
specifically and individually indicated by reference. 

It will be apparent to one of ordinary skill in 
the art that many changes and modifications can be made 
in the invention without departing from the spirit or 
35 scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: BioNebraska, Inc. 

(B) STREET: 3820 NV 46th Street 

(C) CITY: Lincoln 

(D) STATE: NE 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 68524 

(G) TELEPHONE: 402-470-2100 

(H) TELEFAX: 402-470-2345 

(ii) TITLE OF INVENTION: Enzymatic Method for Modification of 
Recombinant Polypeptides 



(iii) NUMBER OF SEQUENCES: 26 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/095,162 

(B) FILING DATE: 20-JUL-1993 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE; GLPl 7-36-NH2 (Glucagon-lxke Peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLP1(7-3A)-Ala-Phe-Ala 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Ala Phe Ala 
20 25 30 



PCTAJS94/08125 



WO 95/03405 

51 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (7-36)-Gly 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 . 10 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg Gly 

20 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (7-34) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys 
20 
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2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

His Asp Glu Phe Glu Aug His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3A amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-34) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
His ASP Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 



5 10 



ser ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 



Val Lys 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCEIPTIOB ; SEQ ID H0i7i 
T,. Ala «p Al. lU Ph. thr As. Ser Tjr Arg Ly. V.l L.u 01, Olu 

1 ^ 

L«u Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 



20 



25 



Glu Ser Asn Gin Glu Arg Gly Ala Arg 
35 ^0 



(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
Ala Arg Leu Gly 
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2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



xi) SEQUENCE DESCRIPTION: SEQ ID 
Asn Gly Pro Arg 



(2) INFORMATION FOR SEQ ID N0:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Enterokinaee cleavage enzyme 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 
GACGACGACG ATAAA 



wo 95/03405 



PCTAJS94/08125 



55 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANPEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fvii) IMMEDIATE SOURCE: 

(B) CLONE: Factor Xa cleavage enzyme 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
ATTGAAGGAA GA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Thrombin cleavage enzyme 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
AGAGGACCAA GA 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

^^''^ ^'S^ToLf SSui^in Cleaving en.y.e 

(^i) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
AGAGGAGGA 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2A base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 

fvii^ IMMEDIATE SOURCE: 

(vii) l^J^'^^j^Q^^ ^^^i^ cleavage enzyme 

(^i) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATCCTTTTC ATCTGCTGGT TTAT 



24 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-36) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
1 5 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 

Val Lys Gly Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-44) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 « 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu 
35 ^0 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
fvii) IMMEDIATE SOURCE: 

' Z) CLONE: GRF (l-AD-Ala-Arg-Leu-Ala 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
Tyr Ala Asp Ala lie Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 

Leu Ser Ma Arg Lys Leu Leu Oln Asp He Met Ser Arg Gin Gin Gly 

20 

Glu ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Ala Glu Gly T.r Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 " 

Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Xaa 

20 " 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-37) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
Ala Arg Leu Ala 



His Asp 
1 



Ser Ser 




Val Lys Gly Arg Gly 
35 



1 
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(2) INFORMmON FOR SEQ ID K0:21: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ^5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

^ (B) CLONE: GRP Cl-44)-Gly 

(^i) SEQUENCE DESCRIPTION: SEQ ID N0:21; 

» QoT- Tvr ArR Lys Val Leu Gly Gin 
Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg J 

1 5 

Ti« Mpt Ser Are Gin Gin Gly 
Leu ser Ala Arg Lys Leu Leu Gin Asp He Met 

20 

Glu ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Gly 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



Arg Ala Arg Leu Ala 
1 ^ 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLPl (1-3A ) -Ala-Phe-Ala 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 

Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe He Ala Trp Leu 
20 25 30 

Val Lys Ala Phe Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41)-Ala 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala 
35 ^0 
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(2) INFOBHATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4^ amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41) -Ala-Arg-Leu 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
1 5 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: IdLnear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-41) Ala-Arg-Leu-Gly 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
15 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp He Met Ser Arg Gin Gin Gly 
20 25 30 

Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Leu Gly 
35 40 45 
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WHAT IS CLAIMED IS: 

1. A process for modifying a polypeptide by 
transpeptidation comprising: contacting together an 
addition unit^ an endopeptidase enzyme specific for 
an enzyme cleavage site, and the recombinant 
polypeptide of at least one leaving unit and a core, 
wherein the polypeptide is a recombinant polypeptide 
leaving unit is linked to the core by the enzyme 
cleavage site recognized by the endopeptidase 
enzyme, to produce a modified recombinant 
polypeptide having the addition unit attached to the 
core and substituted for the leaving unit. 

2. The process according to claim 1, wherein the 
endopeptidase enzyme is a serine or cysteine 
peptidase. 

3. The process according to claim 1, wherein the 
endopeptidase enzyme is selected from the group 
consisting of trypsin, thrombin, chymotrypsin, 
enterokinase, subtilisin, ficin, papian, and factor 
Xa. 

4. The process according to claim 1, wherein the core 
is a truncated version of a natural polypeptide. 

5. The process according to claim 1, wherein the core 
is GLPl (7-34) (SEQ ID N0:4), GLPl (1-34) (SEQ ID 
N0:6), or GRF (1-41) (SEQ ID N0:7). 

6. The process according to claim 1, wherein the 
leaving unit of the recombinant polypeptide 
comprises one or more amino acid residues. 

7. The process according to claim 1, wherein the 
addition unit of the modified recombinant 
polypeptide comprises one or more amino acid 
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residues which may or may not be altered at the C- 
terminal a carboxy. 

8- The process according to claim 1, wherein for the 
endopeptidase cleavage enzyme trypsin, the addition 
unit is comprised of Gly-Arg-NHz and Gly-Arg-Gly. 

9. The process according to claim 1, wherein for the 
endopeptidase cleavage enzyme thrombin, the addition 
unit is Ala-Arg-Leu-NH2 and Ala-Arg-Leu-Gly (SEQ ID 
N0:8) , 

10. The process according to claim 1, wherein the 
endopeptidase enzyme trypsin cleaves at a cleavage 
site at the carboxy side of -Lys- and -Arg- bonds. 

11. The process according to claim 1, wherein the -Lys- 
moiety is a poor substrate for the endopeptidase 
trypsin when immediately adjacent to an amino acid 
having a carboxylic acid containing side chain. 

12. The process according to claim 1, wherein the 
endopeptidase enzyme thrombin operates on a cleavage 
site for -Arg- within a recognition sequence 
Gly-Pro-Arg. 

13. The process according to claim 1, wherein the 
endopeptidase enzyme thrombin operates on a cleavage 
site for -Arg- within a recognition sequence 
Gly-Ala-Arg. 

14. The process according to claim 1, wherein the 
transpeptidation reaction occurs in a buffered 
solution at a pH of about 5 to about 11. 

15. The process according to claim 1, wherein the 
transpeptidation reaction occurs with an 
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endopeptidase cleavage enzyme : recombinant 
polypeptide molar ratio of about 1:1000 to 1:10,000. 

16. The process according to claim 1, wherein the 
transpeptidation reaction utilizing the trypsin 
endopeptidase occurs with a cleavage 

enzyme: recombinant polypeptide molar ratio of about 
1:1000 to 1:5000. 

17. The process according to claim 1, wherein the 
transpeptidation reaction utilizing the enzyme 
thrombin occurs with a cleavage enzyme : recombinant 
polypeptide molar ratio of about 1:1000 to 1:10,000. 

18. A process for cleavage of a polypeptide at the 
carboxy terminus of the amino acid -Arg-, wherein 
the endopeptidase enzyme thrombin cleaves at the 
carboxy terminus of the amino acid -Arg- within a 
cleavage recognition sequence of Gly-Ala-Arg. 

19. A process for modifying a recombinant polypeptide by 
transpeptidation comprising: 

(a) forming a recombinant polypeptide of a core 
and at least one leaving unit wherein the leaving 
unit is linked to the core by an enzyme cleavage 
site; and 

(b) contacting an addition unit and the 
recombinant polypeptide with an endopeptidase enzyme 
specific for the cleavage site to produce a modified 
recombinant polypeptide having the addition unit 
attached to the core and substituted for the leaving 
unit. 

20. A process for modifying a recombinant polypeptide by 
transpeptidation comprising: contacting together 
addition units, an endopeptidase enzyme specific for 
an enzyme cleavage site, a multicopy recombinant 
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polypeptide of two or more cores, and a leaving unit 
linked to a terininal core, wherein each link is an 
enzyme cleavage site recognized by the endopeptidase 
enzyme, and transpeptidation occurs simultaneously 
with cleavage of the multicopy recombinant 
polypeptide into individual core units each with an 
addition unit linked to its downstream end to 
produce a modified recombinant polypeptide product. 

21. The process according to claim 20, wherein the core 
units are linked through an intraconnecting peptide 
by cleavage sites operated on by the endopeptidase 
enzyme, 

22. A polypeptide having an amino acid sequence (SEQ ID 
N0:18) : 

His-Ala-Glu-Gly-Thr-Phe-Thr- 
7 

Ser-Asp-Val-Ser-Ser-Tyr-Leu-Glu- 

Gly-Gln-Ala-Ala-Lys-Glu-Phe-Ile- 
26 27 

Ala-Trp-Leu-Val-Lys -X 
34 

wherein X is selected from the group consisting of 

(a) Gly-Arg-NHx; 

(b) Gly-Arg-Gly; and 

( c ) Gly-Arg-Gly-NH2 , 

which is produced by the process according to claim 
1. 

23. A polypeptide comprising multiple copies of 
contiguously linked GLPl (1-34) (SEQ ID N0:6), 
wherein the terminal copy is linked to a leaving 
unit. 



24. A polypeptide comprising multiple copies of 
contiguously linked GLPl (7-34) (SEQ ID N0:4), 
wherein the terminal copy is linked to a leaving 
unit . 

25. A polypeptide comprising multiple copies of 
contiguously linked GRF 1-41 (SEQ ID N0:7), wherein 
the terminal copy is linked to a leaving unit. 

26. An expression vector containing a DNA sequence 
coding for a polypeptide of at least one leaving 
unit and a core wherein the leaving unit is linked 
to the core by an enzyme cleavage site which is 
recognized by an endopeptidase enzyme, said enzyme 
being capable of causing the substitution of the 
addition unit for the leaving unit. 

27. A recombinant gene containing a DNA sequence coding 
for a polypeptide of at least one leaving unit and a 
core wherein the leaving unit is linked to the core 
by an enzyme cleavage site which is recognized by an 
endopeptidase enzyme, the enzyme being capable of 
causing the substitution of an addition unit for the 
leaving unit. 

28. A transformed cell expressing the recombinant gene 
containing a DNA sequence coding for a polypeptide 
of at least one leaving unit and a core wherein the 
leaving unit is linked to the core by an enzyme 
cleavage site which is recognized by an 
endopeptidase enzyme, said enzyme being capable of 
causing the substitution of an addition unit for the 
leaving unit. 



29. The transformed cell according to claim 28, wherein 
the transformed cell comprises a prokaryotic cell. 
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30. The transformed cell according to claim 29, wherein 
the prokaryotic cell is an E. coli . 

31. The transformed cell according to claim 28, wherein 
the transformed cell comprises a eukaryotic cell. 

32. A pharmaceutical composition of an effective amount 
of GRF (l-44)-NH2 (SEQ ID NO: 16) in a physiological 
acceptable carrier to treat short stature syndrome, 
endometriosis and osteoporosis, wherein the 

GRF ( 1-44 )-NH2 (SEQ ID NO: 16) is produced by the 
process according to claim 1. 

33. A pharmaceutical composition of an effective amount 
of GLPl (7-36)-NH2 (SEQ ID N0:1) in a physiological 
acceptable carrier to treat diabetes mellitus type 
II, wherein the GLPl (7-36)-NH2 is produced by the 
process according to claim 1. 

34. A pharmaceutical composition of an effective amount 
of GLPl {7-36)-Gly (SEQ ID NO: 3) in a physiological 
acceptable carrier to treat diabetes mellitus type 
II, wherein the GLPl (7-36)-Gly (SEQ ID NO: 3) is 
produced by the process according to claim 1. 

35. A pharmaceutical composition of an effective amount 
of GLPl (7-36)-Gly-NH2 (SEQ ID NO: 3) in a 
physiological acceptable carrier to treat diabetes 
mellitus type II, wherein the GLPl ( 7-36)-Gly-NH2 
(SEQ ID NO: 3) is produced by the process according 
to claim 1. 

36. A method for treating diabetes mellitus type II 
comprising: administering an effective amount of the 
modified recombinant polypeptide GRF (l-44)-NH2 (SEQ 
ID NO: 16) produced by the process of claim 1. 
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37. A method for treating an human or animal comprising: 
administering an effective amount of the modified 
recombinant polypeptide GLPl (7-36)-NH2 (SEQ ID N0:1) 
produced by the process of claim 1. 

38. A process for modifying a recombinant polypeptide by 
sequential addition by; 

(a) contacting together an endopeptidase 
enzyme specification for an enzyme cleavage site, 
and a recombinant polypeptide of at least one 
leaving unit and a core, forming the hydrolyzed 
product; and 

(b) contacting the hydrolyzed product with an 
endopeptidase enzyme specific for an enzyme cleavage 
site, and an addition unit to produce a modified 
recombinant polypeptide having the addition unit 
attached to the core and substituted at the hydroxy 
group . 



